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(54) Title: REVERSE TWO-HYBRID MEIHOD 



(57) Abstract 



The reverse two-hybrid method has been designed to provide a practical and efficient means of utilizmg yeast cell-based assays to 
screen for molecules that can inhibit protein-protein interactions of interest Existing two-hybrid systems involve reconstitutlon in yeast of 
a transcriptional activator that drives expression of a '*rcpoiter" gene such as HISS or lacZ. Attei^ts to utilize these existing systems for 
drug discoveiy would necessarily involve screening for molecules that interfere with the transcriptional read-out, and would be subject to 
detecting any compound that non-specifically mteifered with transcription. In addition, since currently used reporter genes encode long-lived 
proteins, the assay would have to be perfomed over a lengthy time period to allow for decay of the preexisting r^>arter proteins. Any 
compound that would be toxic to yeast over this time period would also score as a '*hit". The reverse two-hybrid interaction will avoid 
both of these pitfalls by driving the expression of a relay gene, such as the GAL80 gene, which encodes a protein that binds to and masks 
the activation domain of a transcriptional activator, such as Gal4w The reporter genes, which will provide the transcriptional read-out {HIS3 
or lasZ)^ are dependent upon functional Gal4for expression. Only when the level of GalSO masking protein is reduced by interfering with 
the two-hybrid interaction will Gal4 f^mction as a transcriptional activator, providing a positive transcriptional read-out for molecules that 
inhibit the two-hybrid protein-protein interaction. An important feature of the reverse two-hybrid system is that the basal level and half-life 
of the relay protein, GalSO, can be fine-tuned to provide maximum sensitivity. 
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REVERSE TWO-HYBRID METHOD 

5 

TECHNICAL FIELD 

The invention relates to methods and compositions for identifying agents 
which modify intermolecular association between two or more polypeptides. Agents which 
10 specifically inhibit such protein-protein interactions are suitable for use as conunercial 

reagents, pharmaceuticals, and for nKxiulating gene expression in a cell culture and/or animal, 
such as to increase or decrease the expression of a predetermined protein in the cell culture or 
animal, and the like. 

15 BACKGROUND 

Specific protein-protein interactions are fundamental to most cellular and 
organismal functions. Polypq)tide interactions are involved in formation of funaional 
transcription complexes, signal transduction pathways, cytoskdetal organization (e.g., 
microtubule polymerization), polypeptide hormone receptor-ligand binding, organization of 

20 multi-subunit enzyme complexes, and die like. 

Investigation of protein-{)rotein interactions under physiological conditions has 
been problematic. Considerable effort has been made to identify proteins that bind to proteins 
of interest. Typically, these interactions have been detected by using co-precipitation 
experiments in which an antibody to a known protein is mixed with a cell extract and used to 

25 precipitate the known protein and any proteins which are stably associated with it. This 

method has several disadvantages, such as: (1) it only detects proteins whicli are associated in 
cell extract conditions rather than under physiological, intracellular conditions, (2) it only 
detects proteins which bind to the known protein with sufficient strength and stability for 
efficient co-immunoprecipitation, and (3) it fails to detect associated proteins which are 

30 displaced firom the known protein upon antibody binding. For these reasons and others, 
improved methods for identifying proteins which interact with a known protein have been 
developed. 

Twp-|Iy|?rid Systemfj 
One approach has been to use a so-called "two-hybrid" system to identify 
35 polyp^tide sequences which bind to a predetermined polypq)tide sequence present in a fusion 
protein (Chien et al. (1991) Proc. Natl. Aca d. Sci. aJSA> S&: 9578). TTiis approach identifies 
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protein-protein interactions io vivo tiirough reconstitution of a transcriptional activator (Fields 
S and Song O (1989) Nature 24Q: 245), the yeast Gal4 transcription protein. The mediod is 
based on the properties of the yeast Gal4 protein, which consists of separable domains 
responsible for DNA-bindmg and transcriptional activation. Polynucleotides encoding two 
5 hybrid proteins, one consisting of the yeast Gal4 DNA-binding domain fused to a polypeptide 
sequence of a known protem and the other consisting of the Gal4 activation domain fused to a 
polypeptide sequence of a second protein, are constructed and introduced into a yeast host 
cell. Intermolecular binding between the two fusion proteins reconstitutes the Gal4 DNA- 
binding domain with the Gal4 activation domain, which leads to the transcriptional activation 
10 of a reporter gene (e.g., lacZ^ HISS) which is operably linked to a Gal4 binding site. 
Typically, the two-hybrid method is used to identify novel polypeptide sequences which 
interact with a known protein (Silver SC and Hunt SW (1993) Mol. Biol. Rep. 17: 155; 
Durfee et al. (1993) Genes Devel. 2; 555; Yang et al. (1992) Science 257 : 680; Luban et al. 
(1993) Cell 2a: 1067; Hardy et al. (1992) Genes DeveK fi; 801; Bartel et al. (1993) 
15 Piotfighniqyw 14: 920; and Vojtek et al. (1993) £^ 74: 205), However, variations of the 
two-hybrid method have been used to identify mutations of a known protein that affect its 
bmding to a second known protein (Li B and Fields S (1993) FASEB J. J: 957; Lalo et al. 
(1993) Proc. Nad. Acad, Sci. fUSA^ 2Q: 5524; Jackson et al. (1993) MoL Cell. Biol. 13; 
2899; and Madura al. (1993) J. Biol. Chem. 268 : 12046). Two-hybrid systems have also 
20 been used to identify mteracting structural domains of two known proteins (Bardwell et al. 
(1993) pied, Micrpbjol, fi: 1177; Chakraborty et al. (1992) J. Biol. Chem. 262: 17498; 
Staudinger et al. (1993) J. Biol. Chem. 268: 4608; and MUne GT and Weaver DT (1993) 
Qenes Dgv^l, 2; 1755) or domains responsible for oligomerization of a single protein 
awabuchi et al. (1993) Oncogene g; 1693; Bogerd et al. (1993) J. Virol. £7: 5030). 
25 Variations of two-hybrid systems have been used to study the m vivo activity of a proteolytic 
enzyme (Dasmahapatra et al. (1992) Proc. Natl, Acad. Sci. fUSA^ 89: 4159). Alternatively, 
an E. coli/BCCP interactive screening system (Germino et al, (1993) Proc. Natl. Acad. Sgi, 
OLSjL} 9Q: 933; Guarente L (1993) Proc. Natl. Aca d. Sci. m S A -fc gO: 1639) can be used 
to identify interacting protein sequences (i.e., protein sequences which heterodunerize or form 
30 higher order heteromultimers). 

Each of these two-hybrid methods rely upon a positive association between 
two Gal4 fusion proteins thereby reconstituting a functional Gal4 transcriptional activator 
which then induces transcription of a reporter gene operably linked to a GaI4 binding site. 
Transcription of the reporter gene produces a positive readout, typically manifested either (1) 
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as an enzyme activity (e.g., /3-gaiactosidase) that can be identified by a colorimetric enzyme 
assay or (2) as enhanced cell growth on a defined medium (e.g., HISS). Thus, these methods 
are suited for identifying a positive interaction of polypq)tide sequences, but are poorly suited 
for identifying agents or conditions which alter (e.g., inhibit) intermolecular association 
5 between two polypeptide sequences. In part, this is because a failure to obtain 

expression of the rq)orter gene can result from many events which do not stem from a 
specific inhibition of binding of the two hybrid proteins. For example, a two-hybrid system 
using a reporter gene that stimulates growth under defined conditions theoretically can be used 
to screen for agents that inhibit the intermolecular association of the two hybrid proteins, but 

10 it will be difficult or impossible to discriminate agents that specifically inhibit the association 
of the two hybrid proteins from agents which simply inhibit cell growth. Hius, an agent 
which is cytotoxic to yeast (e.g., bleach, phenol, ketoconazole, cycloheximide) will prevent 
cell growth without specifically inhibiting the interaction of two hybrid proteins and will score 
falsely as a positive hit. Similarly, a conventional two^ybrid system using a lacZ reporter 

15 gene will wisely score general transcription or translation inhibitors (e.g., cycloheximide) as 
being inhibitors of two hybrid protein binding. Thus, two-hybrid systems that produce a 
positive readout contingent upon intermolecular binding of the two hybrid proteins are 
generally not suitable for screening for agents which inhibit binding of the two hybrid 
proteins. 

20 Unfortunately, it would be desirable to have an efficient screening method for 

identifying compounds which specifically alter the intermolecular association between two 
known polypeptide sequences undw physiological conditions. Present two-hybrid methods 
rely on a positive readout and do not afford a m^od for identifying binding inhibitors (or 
binding competitors) with satisfactory sensitivity and/or selectivity. 

25 Thus, there is a need in the art for compositions and methods which can be 

used to efficiently identify agents that specifically alter the intermolecular association between 
two polyp^tide sequences in yim* The present invention fiilfills these and other needs. 

The references discussed herem are provided solely for their disclosure prior 
to the filing date of the present application. Nothing herein is to be construed as an admission 

30 that the inventors are not entitled to antedate such disclosure by virtue of prior invention. All 
publications and patent applications herem are incorporated by reference to Ae same extent as 
if each individual publication or patent application was specifically and individually mdicated 
to be incorporated by reference. 
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SUMMARY PF Tff^ INVSNnON 

The present invention provides several novel methods and compositions for 
identifying agents which alter intermolecular binding between two polypeptide species m a cell 
or in a cell-free transcription reaction. The invention relates to a general method, referred to 
5 herein as a reverse two-hybrid method, wherein agents which disrupt an intermolecular 
association between two interacting polypeptides thereby generate a selectable and/or 
detectable readout (e.g., complementation of an auxotrophic phenotype, expression of a 
detectable reporter molecule, and the like). Typically, a reverse two-hybrid mediod produces 
a positive readout under conditions wherein an agent blocks or otherwise inhibits the 

10 intermolecular binding of the interacting polypeptides. A positive readout condition is 
generally identified as one or more of the following detectable conditions: (1) an increased 
transcription rate of a predetermined reporter gene, (2) an increased concentration or 
abundance of a polypeptide product encoded by a predetermined rq>orter gme, typically such 
as an enzyme which can be readily assayed ^ vivo , and/or (3) a selectable or otherwise 

15 identifiable phenotypic change in an organism (e.g., yeast) harboring the reverse two-hybrid 
system. Generally, a selectable or otherwise identifiable phenoQrpic change diat diaracterizes 
a positive readout condition confers upon die organism either: a selective growth advantage on 
a defined medium, a mating phenotype, a characteristic morphology or developmental stage, 
drug resistance, or a detectable enzymatic activity (e.g., /}-galactosidase, luciferase, alkaline 

20 phosphatase, and the like). In this manner, it is possible to efficiently identify agents 
(including but not limited to polyp^tides, small molecules, and oligonucleotides) which 
inhibit intermolecular binding between two predetermined interacting polypeptides. 

In an aspect of the invention, a reverse two-hybrid system is composed of: (1) 
a first hybrid protein, ^) a second hybrid protein which binds to the first hybrid protein under 

25 control conditions (e.g.,. physiological conditions in the absence of agent), (3) a relay (or 

signal inverter) gene which is efficiently expressed as a consequence of the first hybrid protein 
and the second hybrid protein being functionally bound to each other, and (4) a r^orter gene 
which is efficiently expressed when the product of the relay (or signal inverter) gene is 
substantially absent and is either poorly expressed or not expressed when the relay (or signal 

30 inverter) gene product is efficiently expressed. The first hybrid protein and second hybrid 
protein bind to each other through interacting polypeptide segments (i.e., a portion of the first 
hybrid protein preferentially binds to a portion of the second hybrid protein forming a 
heterodimer or higher order heteromultimer comprising the first and second hybrid proteins; 
said binding portions of each hybrid protein are termed "interacting polypeptide segments"). 
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The first hybrid protein comprises: (1) a first interacting polypeptide sequence 
in polypq)tide linlcage with (2) a DNA-binding domain of a transcriptional activator protein or 
other DNA binding protein (e.g., a repressor). The second hybrid protein comprises: (1) a 
second interacting polypq)tide sequence, enable of fbrming an intennolecular association with 
5 the first interacting polypeptide sequence under control conditions (e.g., physiological 
conditions and absence of agent) in polypeptide linkage with Q) an activation domain of a 
transcriptional activator protein, whereby intennolecular binding between the first hybrid 
protein and the second hybrid protein (via the interacting polypeptide sequences) thereby 
unit^ the DNA-binding domain of the first hybrid protein widi the activation domain of the 

10 second generating a transcriptional activator function. Genially, the first hybrid protein and 
the second hybrid protem are encoded by polynucleotides which are constitutively expressed in 
a host organism (e.g., a eukaryotic or prokaryotic cell, or multicellular organism). 

The relay gene (alternatively t^med the signal inverter gene) is operably 
linked to a transcriptional regulatory sequence (a "relay transcriptional regulatory sequence") 

IS which is positively regulated by the transcriptional activator that is formed by the 

intermolecular binding of the first hybrid protein to the second hybrid protein. Hence, when 
the first hybrid protein binds to the second hybrid protein (via the interacting polypq)tide 
sequences), the transcriptional activator formed th^eby bmds to a transcriptional regulatory 
sequence operably linked to the relay gene and enhances the net transcription of the relay 

20 gene. The relay gene encodes a protein that r^resses transcription of a rqwrter gene. Thus, 
when the first and second hybrid protems are functionally bound to each other, the relay gene 
is expressed and thereby represses transcription of the reporter gene(s). In an embodiment, 
such relay proteins are of the type often referred to m the art as "negative regulators of 
transcription". In an embodiment of the invention, the relay gene is a negative regulator of 

25 transcription in yeast; for example but not limitation the GAL80 gene can serve as a relay gene 
in yeast. In embodiments where host organisms are employed to harbor the reverse two- 
hybrid system, the relay gene is often a gene which naturally occurs in the germline DNA of 
the host organism species, and frequently can be an endogenous germline gene, or 
alternatively may be introduced into the host organism as exogenous DNA, typically into a 

30 host genome that lacks the corresponding functional endogenous gene (e.g., a "knockout 
background"). 

The reporter gene is operably linked to a transcriptional regulatory sequence 
("reporter transcriptional regulatory sequence") which is negatively regulated by the gene 
product of the relay gene and which is induced in the absence of the relay gene product. 
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Thus, transcription of the reporter gene is repressed in control conditions (e.g., physiological 
conditions in the absence of agent) wherein the two hybrid proteins bind to each other and 
form a transcriptional activator that increases transcription of the relay gene. Generally, the 
relay gene product either binds to the transcriptional regulatory sequence operably linked to 

5 the reporter gene, or binds to a transcription protein that binds to the transcriptional regulatory 
sequence operably linked to the reporter gene. The net transcription rate of the reporter gene 
is reduced (or completely blocked) as a consequence of the relay gene product binding to the 
rqwrter gene transcriptional regulatory sequence and/or to a transcription protein required for 
constitutive expression of the rq>orter gene. Any of a variety of rqwrter genes that produce a 

10 positive readout can be used. For example and not limitation, suitable reporter genes are 
those which (1) confer a selectable phraotype to cells m which the reporter gene is efficiendy 
expressed, and/or (2) encode a gene product (e.g., enzyme) which is convenientiy detected 
such as by is ^ assay or the like. Suitable genes which confer a selectable phenotype are 
exemplified by, but not limited to, genes which complement auxotrophic mutations in a host 

15 organism (e.g., yeast fflSJ), genes which encode drug resistance (e.g., ne</), genes which 
induce cell proliferation, and other genes whose expression confers a selective growth 
advantage. Suitable genes which encode a gene product whidi is convenientiy detected 1q sj^ 
are exemplified by, but not limited to, /3-gaIactosidase (e.g., soli iacZ), luciferase, alkaline 
phosphatase, horseradish peroxidase, and the like. 

20 The invention provides polynucleotides encoding a first hybrid protein and a 

second hybrid protein. Such polynucleotides encode a DNA-binding domain or activation 
domain of a transcriptional activator and convenientiy can have a cloning site for adjacent 
insertion, in reading frame, of polynucleotide sequences encoding one or more interacting 
polypeptide sequence(s). Typically, a first polynucleotide wUl encode a first hybrid protein 

25 composed of a first predetermined interacting polypeptide sequence and a DNA-binding 
domain of a transcriptional activator; a second polynucleotide will encode a second hybrid 
protein composed of a second predetermined interacting polypeptide sequence and an 
activation domain of a transcriptional activator, wherein the DNA-binding domain of the first 
hybrid protein can reconstitute with the activation domain and form a fiinaional transcriptional 

30 activator. Often, the DNA-binding domain and the activation domain of the hybrid protein 
pair are derived from the same naturally occurring transcription activator (e.g., Gal4). 
However, those of skill in the art can select DNA-binding domains and activation domains 
from distinct transcription activators which can reconstitute to form a functional transcriptional 
activator which does not occur in nature (e.g., a DNA-binding domain of the bacterial lexA 
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protein caa be used in conjunction with a transcriptional activator from the viral protein, 
VP 16; Vojtek et al. (1993) op.cit.) . Transcription and translation of such a polynucleotide 
produces a hybrid (or fusion) protein composed of an interacting polynucleotide segment and a 
DNA-binding domain or activation domain of a transcriptional activator. 
S The invention also provides polynucleotides which comprise a transcriptional 

regulatory sequence operably Imked to a relay (or signal inverter) gene. A relay (or signal 
inverter) gene encodes a protein that inhibits or otherwise rq)resses expression (typically 
transcription) of a pred^ermined rq)orter gene. Most usually, a relay protem is a negative 
regulator of transcription for a predetermined gene or gene subset. In an embodunent, the 

10 relay protein is a transcription repressor protein that bmds to a polynucleotide sequence and 
th^eby inhibits transcription of a cis-linked and operably linked sequence. In an alternative 
embodiment, the relay protein binds to a protein that is a positive regulator of transcription of 
a predetermined gene or gene subset, and as a consequence of binding thereby inhibits the 
transcriptional activity of the positive regulator. One variety of snch a relay protein binds to 

15 and blocks the activation domain(s) of transcriptional activators. Although a variety of 
suitable relay proteins are apparent to those of skill in the art, tiiis category of relay protein 
can be exemplified by the mammalian mdm2 oncoprotein which binds tiie transactivation 
domain of the tumor suppressor protein p55, and the yeast GalSO protein which binds and 
inactivates the activation domam of Gal4. In an embodiment, the relay protein comprises a 

20 mutation, addition, or deletion that reduces the stability of the relay protein in vivo as 

compared to the naturally occurring cognate relay protein. Relay proteins can be referred to 
as signal inverter proteins, as they serve to invert a positive transcriptional signal (the 
reconstitution of a functional transcriptional activator by binding of the two hybrid proteins) 
into a negative transcriptional signal, which reduces transcription of a pred^ermmed reporter 

25 gene. Generally, a polynucleotide encoding a relay proteui is operably linked to a relay 
transcriptional regulatory sequence that produces transcription of the relay gene dependent 
upon functional reconstitution of the DNA-binding domain and activation domain of the two 
hybrid proteins. For example and not limitation, such a relay transcriptional regulatory 
sequence can comprise a promoter and a polynucleotide sequence comprising one or more 

30 site(s) which bind(s) a reconstituted functional transcriptional activator formed by association 
of the two hybrid proteins; for example, if the two hybrid transcriptional activator comprises a 
lexA DNA-binding domain, the relay transcriptional regulatory sequence operably Imked to the 
relay gene can comprise one or more lexA binding site sequences, arrayed in tandem. 

The invention also provides polynucleotides which comprise a transcriptional 
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regulatory sequence operably linked to a reporter gene. TTie reporter gene encodes a protein 
that confisrs a selectable phenotype on a host cell and/or can be detected by an in vivo assay, 
such as an ia siju enzymatic assay (e.g., host cells expressing a lacZ reporter can be detected 
as blue staining cells in the presence of X-gal). A transcriptional regulatory sequence 
5 operably linked to the reporter gene comprises a promoter and generally produces constitutive 
transcription of the relay gene contingent upon the substantial absence of the relay protein. In 
an embodiment, the rqwrter transcriptional regulatory sequence operably linked to the 
rq)orter gene comprises a binding site for a rday protein, wherein binding of die relay protein 
to the reporter transcriptional regulatory sequence inhibits constitutive transcription of the 
10 rq)orter gene. In an altwnative embodiment, the rqwrter transcriptional regulatory sequence 
linked to the rqwrter gene comprises a binding site for a transCTiptional activator protein, 
wherein binding of a constitutive transcriptional activator protein to the rqwrter transcriptional 
regulatory sequence produces constiftitive transcription of the cis-Iinked rqwrter gene, and 
wherein the relay protein binds to or otherwise inactivates the transcriptional activator protein, 
15 thereby rq)ressing constitutive egression of the rqwrter gene. 

TTie invention also provides host organisms (typically unicdiular organisms) 
which harbor a reverse two-hybrid system, typically in the form of polynucleotides encoding a 
first hybrid protein, a second hybrid protein, a relay gene, and/or a reporter gene, wherein 
said poIynucleotide(s) are either stably replicated or introduced for transient expression. In an 
20 embodiment, the host organism is a yeast cell (e.g., Saccharomvces cervisiae^ m which the 
germline GAL80 gene is functionally inactivated, the relay gene encodes Gal80, and the 
rq)ortCT gene transcriptional r^atoiy sequence comprises a Gal4-responsive promoter. 

The invention also provides a method for Identifying agents that inhibit 
binding of a first interacting polypeptide to a second interacting polypeptide. The method 
25 employs the reverse two-hybrid system described sig>ra, wherein a first hybrid protein 
comprises the first interacting polypeptide and a second hybrid protein comprises a second 
interacting polypeptide. Heterodimerization (or higher order heteromultimerization) between 
the first hybrid protein and the second hybrid protein produces transcription of a relay gene 
encoding a protein which inhibits expression of a reporter protein. Host organisms harboring 
30 such a reverse two-hybrid system are culmred in the presence of an agent, such as a diffusible 
small molecule (typical MW < 5,000, preferably < 1.000) or a transfected cDNA expression 
polynucleotide encoding a polypeptide agent, and expression of die host organism reporter 
gene is determined and standardized to a parallel blank culture which lacks an agent. Agents 
which produce a gnificam increase in expression of die reporter gene in a host organism 
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after a suitable time period (e.g., usually at least 1 hour, often at least 3 hours, preferably 
about 6 hours, occaisionally overnight or longer) are thereby identified as inhibitors for 
blocking the intennolecular association between the first an second interacting polypeptide 
sequences. Such protein interaction inhibitors are candidate drugs for pharmaceutical use 
S and/or for use as conmiercial research reagents. In an embodiment of the invention, yeast 
cells are the host organism, the reporter gene encodes ^-galactosidase and/or a protein that 
complements an auxotrophic mutant yeast host cell, and the first and second interacting 
polypeptide sequences each comprise a binding domain derived from a signal transduction 
protein. 

10 The invention also provides a kit comprising a reverse two-hybrid system, a 

host cell, and an instruction manual. Such kits may optionally include a panel of agents for 
testing. 



DETAILED DESCWnON 

15 Definitions 

Unless defined otherwise, all technical and scientific terms \ised herein have 
the same meaning as conunonly understood by one of ordinary skill in the art to which this 
invration belongs. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, the prefierred 

20 methods and materials are described. For purposes of the present invention, the following 
terms are defined below. 

In the polypeptide notation used herein, the lefthand direction is the amino 
terminal direction and the righthand direction is the carboxy-terminal direction, in accordance 
with standard usage and convention. Similarly, unless specified otherwise, the lefthand end of 

25 single-stranded polynucltotide sequences is the 5' end; the lefthand direction of double- 
stranded polynucleotide sequences is referred to as the 5* direction. The direaion of 5* to 3' 
addition of nascent RNA transcripts is referred to as the transcription direction; sequence 
regions on the DNA strand having the same sequence as the RNA and which are S* to the 5' 
end of the RNA transcript are referred to as "upstream sequences"; sequence regions on the 

30 DNA strand having the same sequence as the RNA and which are 3' to the 3' end of the RNA 
transcript are referred to as "downstream sequences". 

The term "naturally-occurring" as used herein as applied to an object refers to 
the fact that an object can be found in nature. For example, a polypeptide or polynucleotide 
sequence that is present in an organism (including viruses) that can be isolated from a source 
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in nature and whidi has not been intentionally modified by man in the laboratory is naturally- 
occurring. 

The term "agent" is used herein to denote a chemical compound, a mixture of 
chemical compounds, a biological macromolecule, or an extract made from biological 
5 materials such as bacteria, plants, fiingi, or animal (particularly manunalian) cells or tissues. 
Agents are evaluated for potential activity as specific protein interaction inhibitors (i.e., an 
agent which selectively mhibits a binding interaction between two predetermined polypeptides 
but which does not substantially interfere with cell viability) by inclusion in screening assays 
described hereinbelow. 

10 The term "protein interaction inhibitor" is used herein to refer to an agent 

which is identified by one or more screening method(s) of the invention as an agent which 
selectively mhibits protein-protein binding between a first interacting polypeptide and a second 
interacting polypq)tide. Some protein interaction inhibitors may have ther^)«itic potential as 
drugs for human use and/or may serve as conmiercial reagents for laboratory research or 

15 bioprocess control. Protein interaction inhibitors which are candidate drugs are then tested 
further for activity in assays which are routinely used to predict suitability for use as human 
and veterinary drugs, including ia vivo administration to non-human animals and often 
including administration to human in approved clinical trials. 

As used herein, the term "operably linked" refers to a Imkage of 

20 polynucleotide elements in a functional relationship. A nucleic acid is "operably linked" when 
it is placed into a functional relationship with another nucleic acid sequence. For instance, a 
promoter or enhancer is operably linked to a coding sequence if it affects the transcription of 
the coding sequence. Operably linked means that the DNA sequences bemg linked are 
typically contiguous and, where necessary to join two protein coding regions, contiguous and 

25 in reading frame. However, since enhancers generally function when separated from the 
promoter by several kilobases and intronic sequences may be of variable lengths, some 
polynucleotide elements may be operably linked but not contiguous. 

As used herein, the term "endogenous DNA sequence" refers to naturally- 
30 occurring polynucleotide sequences contained in a eukaryotic or prokaryotic cell. Such 
sequences include, for example, chromosomal sequences (e.g., structural genes, promoters, 
enhancers, recombinatorial hotspots, repeat sequences, integrated proviral sequences). A 
"predetermined sequence" is a sequence which is selected at the discretion of the practitioner 
on the basis of known or predicted sequence information. An exogenous polynucleotide is a 
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polynucleotide which is transferred into a eukaryotic or prokaryotic cell. 

As used herein the term "physiological conditions" refers to temperature, pH, 
ionic strength, viscosity, and like biochemical parameters which are compatible with a viable 
organism, and/or which typically exist intracellularly in a viable cultured yeast cell or 
5 mammalian cell. For example, the intracellular conditions in a yeast cell grown imder typical 
laboratory culture conditions are physiological conditions. Suitable in vitro reartion conditions 
for io vitro transcription cocktails are generally physiological conditions. In general, Iq vitro 
physiological conditions comprise 50-200 mM NaCl or KCl, pH 6.5-8.5, 20-45**C and 0.01- 
10 mM divalent cation (e.g., Mg+*, Ca**); preferably about 150 mM NaQ or KQ, pH 7.2- 

10 7.6, 5 mM divalent cation, and often include 0.01-1.0 percent nonspecific protein (e.g., 
BSA). Particular aqueous conditions may be selected by the practitioner according to 
conventional methods. For general guidance, the foliowmg buffered aqueous conditions may 
be applicable: 10-250 mM NaQ, 5-50 mM Tris HQ, pH 5-8, with optional addition of 
divalent cation(s) and/or metal chelators and/or nonionic detergents and/or membrane 

15 fractions. 

The terms "functional disruption** or "functionally disrupted** as used herein 
means that a gene locus conq)rises at least one nmtation or structural alteration such that the 
functionally disrupted gene is substantially incapable of directing the efficient expression of 
functional gene producL 

20 As used herein, the terms "interacting polypeptide segment" and "interacting 

polypeptide sequence" refer to a portion of a hybrid protein which can form a specific bmdmg 
interaction with a portion of a second hybrid protem under suitable binding conditions. 
Generally, a portion of the first hybrid protein preferentially binds to a portion of the second 
hybrid protein forming a heterodimer or higher order heteromultimer comprising the first and 

25 second hybrid proteins; the binding portions of each hybrid protein are termed interacting 
polypeptide segments. 

Descriorion of the Invention 

Generally, the nomenclature used hereafter and the laboratory procedures in 
30 cell culture, molecular genetics, and nucleic acid chemistry and cell culture described below 
are those well known and commonly employed in the art. Standard techniques are used for 
recombinant nucleic acid methods, polynucleotide synthesis, and microbial culture and 
transformation (e.g., electroporation, lipofection). Generally enzymatic reactions and 
purification steps are performed according to the manufacturer's specifications. The 
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techniques and procedures are generally performed according to conventional methods in the 
art and various general references generally . Sambrook et al. Molecular Cloning: A 
Laboratory Manual, 2d ed. (1989) Cold Spring Harbor Laboratory Press, Cold Sprmg 
Harbor, N.Y., which is incorporated herein by reference) which are provided throughout this 
5 document. The procedures therem are believed to be well known in the art and are provided 
for the convenience of the reader. All the information contained therein is incorporated herein 
by reference. 

The reverse two-hybrid method is generally applicable for identifying agents 
which inhibit binding bmreen a variety of predetermined interacting polypeptides. 

10 

Overview 

A basis of the present invention is a strategy for screening a bank of agrats 
with a reverse two-hybrid system to identify agents whidi mhibit the intermolecular 
association of two interacting polypeptide sequences. Thus, in a reverse two-hybrid system 
15 there is at least one pair of mteracting polypeptide sequences, with a first interacting 

polypq)tide sequence present in one of the hybrid protein species and a second int^acting 
polypeptide sequence present in the odier hybrid protein species. The choice of interacting 
polypeptide sequences incorporated in a reverse two-hybrid system is selected at tiie discretion 
of the practitioner. For example, a reverse two-hybrid system suitable for idratifying agents 

20 which inhibit Fos/Jun leucine zipper formation may be composed of a first hybrid protein 
having an interacting polypeptide sequence comprising a Fos leucine zipper and a second 
hybrid protem having an interacting polypeptide sequence comprising a Jun leucine zipper. A 
variety of interacting protein sequences can be used; for example and not limitation, these 
include: transcription factor binding domains, multisubunit proteins, signal transduction 

25 proteins (G protems, members of ras/raf/MEK signaling patiiway(s), tumor suppressor protein 
binding domains pS3f), and die like), polyp^tide ligands and tiieir cognate receptor(s), 
active sites of en^mes which catalyze reactions involving binding to a polypeptide substrate 
and the polypeptide substrate itself, and essentially any pair of protein sequences which form 
an intermolecular association under physiological conditions. Generally, interacting 

30 polypeptides form heterodimers with a dissociation constant (Ko) of at least about 1 x 10* M**, 
usually at least 1 x 10* M'*, typically at least 1 x 10» W\ preferably at least 1 x 10* M'^ to 1 
X 10^ M*^ or more, under suitable physiological conditions. 

Reverse two-hybrid systems are used for deteaing tiie ability of agents to 
inhibit the intermolecular binding of two interacting polypeptides and provide for fecile high- 
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throughput screening of agent banks (e.g., compound libraries, peptide expression libraries, 
and the like) to identify protein interaction inhibitors which preferentially inhibit 
intermolecular binding between two predetermined interacting polypeptide species. Such 
protein mteraction inhibitors (specific binding antagonists) can modulate biochemical activity 
5 of the predetermined interacting specie(s) and thereby modulate biological function. Agents 
which alter the intermolecular association of the two interacting polypeptide sequences in the 
hybrid proteins, generally by inhibiting heterodimeric binding of the two hybrid proteins, 
score positively in the reverse two-hybrid system. The protein interaction inhibitors thereby 
identified are candidate drugs for human and v^erinary therapeutic use and/or are suitable 

10 commercial reagents for laboratory researdi or bioprocess control. 

An agent capable of specifically inhibiting protein-protein binding of a 
therapeutically relevant protein interaction ig vivo can be used for th^^y of disease or for 
modulation of gene egression in cells and organisms. Typically, an efiRcacious dose of a 
protein interaction inhibitor is administered to a patient as a ther^)eutic or prophylactic for 

IS treating a pathological condition (e.g., cancer, inflanmiation, lymphoproliferative diseases, 
autoinmmne disease, and die like). 



Description of the Preferred Embodiment 

In ord^ to illustrate the invention, a description of a preferred embodunrat is 

20 presented below. This embodiment comprises a reverse two-hybrid system in yeast cells that 
are functionally disrupted for endogenous GAL80 expression, wherein the mtmnolecular 
association of the first and second hybrid protems activates transcription of a GAL80 relay 
gene. Expression of GAL80 represses the transcriptional activity of constitutively expressed 
Gal4 protein and inhibits transcription of a Gal4-dependent reporter gene. 

25 A variety of alternative embodiments and variations will be apparent to those 

of skill in the art, including alternative relay genes, alternative host cells (e.g., manmialian, 
bacterial, fungal, insect, and the like), variations of the basic reverse two-hybrid method, and 
others. 

30 Two-Hybrid system^ 

Transcriptional activators are proteins that positively reguliate the expression of 
specific genes. They can be functionally dissected into two structural domains: one region 
that binds to specific DNA sequences and thereby confers specificity, and another region 
termed the activation domain that binds to protein components of the basal gene expression 
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madiinery (Ma and Ptashne (1988) £^ 55: 443). These two domains need to be physically 
connected in order to function as a transcriptional activator. Two-hybrid systems exploit this 
finding by hooking up an isolated DNA binding domain to one protein protein X), while 
hooking up the isolated activation domain to another protein (protein Y). When X and Y 
5 interact to a significant extent, the DNA binding and activation domains will now be 
connected and the transcriptional activator function reconstituted (Fields and Song (1989) 
Nature 24Q: 245). The yeast host stram is engineered so that the reconstituted transcriptional 
activator drives the expression of a specific reporter gene such as HIS3 or tocZ, which 
provides the read-out for the protem-protein mteraction (Field and Song (1989) opxit : Chein 

10 et al. (1991) op.cit.) . One advantage of two-hybrid systems for monitoring protein-protein 
interactions is their sensitivity in detection of physically weak, but physiologically uiq)ortant, 
protein-protein interactions. As such it offers a significant advantage over other methods for 
detecting protein-^)rotein interactions (e.g., ELISA assay). Unlike the ELISA assay, 
however, the current two4iybrid system is not readily transplantable to drug screening 

15 operations. A major problem with the existing two-hybrid methods is that nonsp^ific 
inhibitors of transcriptional activation score the same as inhibitors of the specific protein- 
protein interaction. 

Negative Regulators of Transcription 
To address the aforementioned problem, the read-out of the conventional two- 

20 hybrid interaction can be reversed by interposition of a relay gene which serves to invert Ae 
ouq)ut produced from mteraction of the two hybrid proteins firom a positive transcriptional 
activator to a negative transcriptional regulator (e.g., repressor). In order to invert the 
readout from a positive transcription activator to a negative transcription repressor, it is 
possible to take advantage of the properties of certain negative regulators of transcription. In 

25 an embodiment, some of these negative regulators block the function of specific transcriptional 
activators by binding to their artivation domain. Two such examples are the ntdm2 
oncoprotein which binds to and masks the trans-activation domain of the tumor suppressor 
protein p53 (Momand et al. (1993) Cell 69: 1237; Oliner et al. (1993) Nature 362 : 857), and 
the yeast Gal80 protein which binds and inactivates tiie transcriptional activator region of Gal4 

30 (Ma and Ptashne (1987) CeU 5Q: 137; Johnston and Carlson (1993) Regulation of Carbon and 
PhQsphate MetaMisn), vol. 2, Cold Spring Harbor Press, Cold Spring Harbor, New York). 
By designing the two-hybrid interaction to drive the expression of a negative regulator of a 
specific transcriptional activator, die resultant system is such that interference with the two- 
hybrid interaction results in increased activity of a transcriptional activator and hence a 
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positive signal. In view of the fact that the biology of the Gal80-Gal4 system is well 
understood in yeast, this pair of negative-regulators/transcriptional activators is suitable for the 
reverse two-hybrid method. In prmciple» the pair of mdm2 and p53 proteins, or any other 
matched pair of transcriptional activator and specific negative regulator, will also work. 
S In the present embodiment, the two interacting hybrid proteins reconstitute a 

transcriptional activator composed of a DNA binding domain derived from the bacterial 
protein encoded by lexA and an activator domam derived from the viral protein VP16 (Vojtek 
et al. (1993) op.cit.) . The reconstituted lexA/VP16 transcriptional activator binds to a relay 
gene operably linked to a transcriptional regulatory 

10 sequence containing tandem copies of a lexO binding site sequence which binds the lexA 
DNA-binding domain. Upon binding of ±t reconstituted lexA/VP16 transcriptional activator 
to die lexO binding site(s), the operably Imked relay gene (GAL80) is efficiently expressed. 
Thus» when the two hybrid proteins are associated (e.g., as a heterodimer or the like)^ the 
GAL80 relay gene is expressed and serves to repress expression of a rq>ort^ gene consttucL 

15 ■ 

The Gal80-Gal4 Svstem 
The Gal80-Gal4 system of regulatory proteins underlies the ability of yeast 
cells to respond to exogenously added galactose and specifically synthesize the enzymes 
needed to utilize it as a carbon/energy source (Johnston and Carlson (1993) o p.cit. . Unless 

20 galactose is present, the Gal80 protein binds and blocks the function of the transcriptional 
activator Gal4. In the absence of the GAL80 gene, the transcriptional activator function of 
Gal4 is not masked and hence expression of galactose-regulated genes no longer requires 
galactose for induction. In the reverse two-hybrid system, die host strain generally is 
functionally disrupted for the endogenous GAI^ genc^, but Gal80 protein is provided through 

25 a two-hybrid driven relay gene construa (see. Experimental Example, infrd) which is operably 
linked to a transcriptional regulatory sequence that binds a bacterial lexA DNA-binding 
domain present in the first hybrid protein. When the two-hybrid interaction is driving the 
expression of the relay gene product. Gal 80, the Gal4-induction of the reporter gene(s) is 
inhibited. When the two-hybrid interaction is blocked, the relay gene (GALSCf) expression will 

30 be turned off and the Gai4-d^endent transcriptional regulatory sequence operably linked to 
the reporter gene(s) is then able to drive expression of the reporter gene(s). 



Techniques for Fine-Tuninp Expression of the Relav Gene 
The sensitivity of this system can be modulated by adjusting the amount of 
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Gal4 or Gal80 protein. A host strain generally contains the wild-type GAL4 gene and hence 
contains very low levels of Gal4 when the yeast cells are cultured with carbon/energy sources 
such as raffinose (Johnston and Carlson (1993) op.cit.V If necessary, the level of Gal4 
protein can be decreased by at least five-fold by culturing the cells in glucose (Griggs and 
5 Johnston (1991) Proc. Natl. Ac ad. Sci. fUSA^ fig: 8597). Higher levels of Gal4 protein can 
be provided by transforming the strain witii a multicopy plasmid encoding Gal4 (Schultz et al. 
(1987). 

The amount and/or stability of the relay protein, GalSO, can also be adjusted. 
Preferably, the stabUity of the Gal80 protein is sufficient such that the addition of protein 

10 mteraction inhibitor agents generates a detectable readout of die reporter gene(s) within about 
six hours, or most usually within the time-frame of an overnight assay. For this td be a 
convenient assay ^proach, Gal80 activity preferably deteriorates at a r^id rate when active 
inhibitor agents are added and die two-hybrid system is inhibited. The half-life of Gal80 
proteins in yeast cells has not been rigorously defined in the art. If Gal80 has a short half- 

15 life, it is generally only necessary to vary the level of transcription of GAL80 by changing 
eidier copy number of the two-hybrid relay gene construct or by varymg the number of 
binding sites for the transcriptional activator (e.g., lexO operator sequences) in the 
transcriptional regulatory sequence of the relay gene construct. If GalSO has an mordinately 
long half-life, it is preferable to engmeer a chimeric GalSO protein with a shorter half-life. 
20 Successful engineering of long-lived protems to proteins with shorter half-lives has been 
achieved by addition of PEST sequences to DHFX (Loetscher et al. (1991) J. Biol. Qiem, 
2^:11213) or by forming /S-galactosidase variants witii different N-terminal residues by in 
vjvQ processing of ubiquitin-jS-galactosidase fusions (Varshavsky et al. (1989) Yeast Genetic 
Engineering, Barr, Brake, and Valenzuela (eds.), Butterworths, pp. 109-143). The latter 
25 method has been well characterized in yeast, such that Gal80 variants with half-lives ranging 
from 2 minutes to over 24 hours can be readily generated. 

The following examples are offered by way of example and not by way of 

lunitation. 

EXPERIMENTAL EXAMPLES 
30 Construction of the appropriate host veajtt str^ln^ 

Since the GAL80-GAL4 system is employed, the reporter genes in the yeast 
strain need to be operably linked to promoters that are responsive to Gal4. Reporter genes 
that have been operably linked to Gal4-responsive promoters were integrated into yeast strains 
(see Construction of Yeast Strains, infra). One of the reporter genes encodes iJ-galactosidase, 
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whose expression allows quantitative transcriptional read-out, if desired. It is possible to 
utilize other reporter genes operably linked to Gal4-responsive promoters, such as ones 
encoding alkaline phosphatase, that would also allow easy quantitation of transcriptional read- 
out. JEY8, JEYIO, and JEY12, three independent progenitors for the reverse-two hybrid host 
S strams, were derived by standard genetic methods from a cross between YM2170 (AIATa ura3 
his3 ade2 fys2 tyrl GAL4* galSOA LEU2:GALl'lacZ\ available from Dr. Mark Johnston, 
Washmgton University, St. Louis, MO) and YPB2 {MATa hzs3 ade2 leu2 ura3 fys2 trpl canl 
gcU4A galSOA LYS2::GAL1-HIS3 VRA3::GALl'ladZ) (Bartel et al. (1993) in Cellular 
Interactions in Development: A Practical Approach, HarUey DA (ed.) Oxford University 

10 Press, Oxford, UK, pp. 153). The progenitor strains {MATa his3 ade2 leu2 uraS tys2 trpl 
GAL4* galSOA LY52::GAL1'HIS3 URA3::GALl-lacZ) contain all the necessary rq)orter genes 
and have been tested for a functional GaI4 protein and rq>orter genes by analysis of galactose- 
induced expression of j3-galactosidase. To test m general wh^er re-mtroduction of Gal80 
protein negatively regulates Gal4 in the system, JEY8 was transformed with a high-cofy 

IS plasmid containing the wild-type GAL80 gene (pBM260; available from Mark JohnstqjVu . 
Washington Univmity, St. Louis, MO). Sufficient expression results in inhibition of ^the 
read-outs from the reporter genes (HIS3 and lacZ)^ which are determined by assaying 
galactosidase activi^ and growth m die absence of histidine. Both of these reporter aptivities 
are scored in yeast grown on plates containing rafBnose (which allows for full acdvityiof 

20 GalSO protein) and galactose (which inactivates Gal80 protein). These tests confirm that in 
these strains the GalSO protem, expressed off its endogenous promoter, suppresses GaI4 
function. Two4iybrid constructs are evaluated for their ability to drive sufficient GALSO 
expression from the Lex(XjAL&) fusion plasmids that are constructed (see, infra), 

25 Construction of the LexO-GAL80 fusion genes 

A chimeric gene (a LexO-GAL80 fusion) is constructed and serves as the relay 
(signal inverter) gene. The DNA-binding domain of the transcriptional activator that is used 
to drive expression of the relay gene is derived from the bacterial protein encoded by lexA and 
has been used before in two-hybrid systems as a fusion with the transcriptional activator from 

30 the viral protein VP16 (Vojtek et al. (1993) op.cit. . Other transcriptional activators that have 
a defined DNA binding site, such as the ACEl gene product of S. cerevisiae (Munder and 
Furst (1992) MoL Cell. Biol. 12: 2091) may be used. The LexO sites are generated by 
mutually primed synthesis (geg. Chapter 8,2A in Current Protocols in Molecular Biology 
(1990) Ausubel, Brent, Kingston, Moore, Seidman, Smith, and Struhl (eds.), Greene 
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Publishing Associates and WUey Interscience, New York, NY) using the oligomer 
5'- 

GCGAATTCCTACTGTATATACATACAGTACCATCTACTGTATATACATACAGT^ 
CGCTCGAGCGGC-3* [SEQ. ID N0:1]. The resulting fragment contains four consensus 
S LexA binding sites in tandem. The DNA product is digested with EcoRI and inserted into the 
EcoRI site of pCZD (Lue et al. (1989) Proc. Natl. Acad. Sci. OJSA^ 8fi: 486) to generate 
pCZD-LexO. The pCZD vector contains a minimal TATA box for recognition of the basal 
transcriptional machinery but requires the addition of specific DNA sequences to effectively 
function as promote box. The Gal80 coding sequence is isolated by PCR using the following 

10 two oligomers: 5'-CGCGGATCCCGTTCnTCCAC TCCCG-3' [SEQ. ID NO:2]; and 

5*-CGGATCCGATGGAAGGATGCCCGCrGCTGC-3' [SEQ. ID N0:3]. TTie template is the 
plasmid pBM260 which contains the GAL80 gene subcloned in YEpl3 (available from Mark 
Johnston, Washington University, St Louis, MO). The GAL80 PC31 product is digested with 
BamHI and inserted into the BamHI site of pCZD-LexO to create pLexO-Gal80. The LexO- 

15 Gal80 fusion is then subclpned into pCfalileo, a 2^ based yeast shuttle vector (20-30 copies per 
cell) carying the ADE2 selectable mark^ (available from Avtar Roopra, Washington 
University, St Louis, MO) to generate pJE42. From this plasmid, CEN- and integrating 
versions are constructed to provide a means of controlling the level of expression of GAL80 
by the two-hybrid interaction. For example, the basal transcription from the 2ft plasmid may 

20 express sufficient Gal80 to require galactose for expression of the reporter gene(s) even in the 
absence of a lexA-based transcriptional activator. Additionally, LexO-ubiquitin-Gal80 fusions 
encoding a shortened half-life Gal80 protein is constructed. 

In order to demonstrate that relay gene constructs comprising a LexO-GALSO 
polynucleotide fusion can be activated by the two-hybrid interaction to sufficient levels for 

25 regulating the Gal4-mediated reporter gaie expression, a positive control is generated. Yeast 
are transformed with a plasmid that contains a fusion of the DNA binding domain QexA) and 
transcriptional activation component (VP 16) of the two-hybrid system and activates 
transcription of the relay gene LexO-GAL80 fusion. The plasmid pLEX-VP16 (available from 
A. Vojtek; Vojtek et al. (1993) op.cit^ is used for the positive control. The ability of two- 

30 hybrid interactions to drive expression of the relay gene is demonstrated. 



Testing for the abilitv of two-hvbrid interactions to dr ive expression of the LexQ-GAL8Q 
fusions 

The two-hybrid interaction that is used to test for its ability to drive sufficient 
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expression of the LexO-GALSO relay gene is the interaction of human H-ras p21 with human 
c-Raf (Van Aelst et al. (1993) Proc. Natl. Ac ad. Sci. (USA) 90: 6213; Vojtek et ai. (1993) 
op cit.). K-ras is linked by m-frame polynucleotide fusion to the VP16 activation domain, and 
Raf is linked by in-frame polynucleotide fusion to the DNA bmding domain of the lexA gene 
5 product. pGBT8-Raf was constructed by ligating EcoRI and PstI linkers to a Raf coding 
sequence isolated by PGR amplification of a human placental cDNA library from Stratagene 
(San Di^o, CA) as described by MacDonald et al. (1993) Mol. Cell. Biol. H: 6615. The 
iZa/gene was cut out of pGBT8*Raf as an EcoRI to PstI fragment and subcloned into the 
EcoRI-PstI site of pBTMl 16 (that contains the LexA DNA binduig domain (Vojtek et al. 

10 (1993) fi]2^£iL) to generate pBTM-Raf (pJEi6). The EcoRI site maintaina the same reading 
frame. p(JBT8K-ras was constructed by PGR amplification of pEXV-K-ras (Hancock et al. 
(1990) ^: 133) sudi that the K-ras sequence is isolated as a Sall-PstI restriction fragment 
which was then subcloned into Sall-Pstl-cut pGBT8. To construct pVPK-ras OpJE44), a PCR 
product of pGBT8K-ras was generated usmg the following oligomers as PGR amplimers: 

15 5*-GGGGATGCATGAGTGAATATAAACTTGTGGTAG-3' (SEQ. ID NO:41 

5*-CGGGATCCTrACATAATTAGAGAGTTTGTGTTTGACrTG-3' [SEQ. ID NO:5] 
and the resultant PGR product was digested with BamHI and subcloned into the BamHI site of 
pVPlS (Vojtek et al. (1993) op.cit.^ to generate pVPK-ras (pJE44). The LexO-GAL80 relay 
gene plasmid, the pBTM-Raf and the pVP-K-ras (pJE44) plasmids are cotransfected into a 

20 host yeast strain and the ability of the two^ybrid interaction to drive sufficient expression of 
GAL80 to prevent the expression of the reporter genes (tocZ and HISS) is determined. 
Growth on galactose is used as an internal positive control to ensure that the promoter is still 
functional. 

25 Testing for the ability of a small molecule to inter fere with a two-hvbrid interaction 

The reverse two-hybrid method is used as a screening assay for identifying 
small molecule inhibitors of protein-protein interaaion, such that an exogenously added small 
molecule can interfere with a two-hybrid interaction. In one example, a reverse two-hybrid 
system utilizes the small molecule estradiol as the protein interaction inhibitor. Estradiol is a 

30 small lipophilic molecule that has been shown to be effective in yeast. It has been shown that 
estradiol reverses the interaction of the hormone binding domain of the estrogen receptor with 
the heat-shock protein HSP90. Thus, a first hybrid protein comprising the hormone binding 
domain of the estrogen receptor in polypeptide linkage to a lexA DNA-binding domain and a 
second hybrid protem comprising the heat shock protein, HSP90, in polypeptide Imkage to the 
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VP16 activation domain are constructed by standard methods. Polynucleotide(s) encoding the 
first and second hybrid proteins, a LexO-GALSO relay gene construct, and a Gal4-<iependent 
reporter gene construct are introduced into the yeast host. Estrogen (e.g., estradiol) is 
evaluated as an agent for inhibiting formation of a functional two-hybrid heterodimer and 
5 thereby producing expression of the reporter gene. 

Testing for the ability of a polvpentide to interfere with a two-hvbrid interaction 

The reverse two-hybrid method is used as a screening assay for identifying 
polypeptide inhibitors of protein-^ rotem interaction, such that an mtracellularly expressed 

10 polypeptide can interfere with a two-hybrid interaction. In one embodiment, a reverse two- 
hybrid system utilizes a polypq>tide e^qiressed from a cotransfected cDNA e;q)ression 
construct as the protein interaction mhibitor. 

A first hybrid protein comprising a first interacting polypeptide sequoice in 
polypq}tide linkage to a lexA DNA-bmding domain and a second hybrid protein comprising a 

IS second interacting polypeptide sequence in polypeptide linkage to the VP16 activ^on domain 
are constructed by standard methods. Polynucleotide(s) encoding the first and second hybrid 
proteins, a Lex(XrAL80 relay gene construct, and a Gal4-dependent reporter gene construct 
are introduced into the yeast host. A polynucleotide encoding and expressing a polypeptide 
typically between 5 and 500 amino acids long (e.g., a library member of a cDNA e;q}ression 

20 library) is also introduced into the yeast cells under conditions wherein the encoded 

polypq>tide is expressed intracellularly. The expressed polypeptide is evaluated as an agent 
for inhibiting formation of a functional two-hybrid heterodimer and thereby producing 
expression of the reporter gene. 

Essentially any of various expression clone libraries known in the art may be 

25 used, including conunercially available expression libraries (Clontech, Inc., Palo Alto, CA). 
Expression clone libraries may also be generated by the practitioner by conventional cloning 
methods and vectors known in the art (e.g., pcD, pSV), especially yeast expression vectors. 
Expression clone libraries comprise a collection of library members, each member comprising 
a cloned polynucleotide sequence (which may comprise mutation(s) or deletions), typically a 

30 cDNA sequence, operably linked to a promoter (and optionally an enhancer) which is 

transcriptionally active in the host cell so that the cloned sequence is transcribed and translated 
into a polypeptide. Genomic DNA sequences (e.g., complete structural genes or fragments 
thereof) may also serve as cloned sequences in expression libraries. Preferably, the cloned 
sequence is inserted in cloning site which facilitates the recovery of the cloned sequence free 
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from the promoter and other sequences which comprise an expression cassette. 

Expression clone library members are transferred into host cells by various 
means, including but not limited to: electroporation, lipofection, viral vector transduction, 
biolistics, and CaP04 precipitation. Expression clone library members may be transferred 
S directly mto host cells, or a relay and/or rq)orter polynucleotide and/or polynucleotide(s) 
encoding the first and second hybrid proteins may be co-transf^ed with expression clone 
library members into a host cell, or a relay and/or reporter polynucleotide and/or 
polynucleotide(s) encoding the first and second hybrid proteins may be transferred into host 
cells subsequent to transfer of expression clone library members. 

10 Cloned polynucleotides can be recovered from expression clone library 

members which are isolated by the screening methods of the invention. Typically, cloned 
sequences are excised by restriction digestion with an enzyme(s) which cleave at the 
boundaries b^een the ends of the cloned sequence (e.g., cDNA) and the remainder of the 
expression clone library mraber. Alternatively, PGR (preferably high-fideliQr PCR) or other 

IS amplificadon method (e.g., LCR) may be performed using primers whidi flank the site at 
whidi the cloned sequence rs mserted in the library member to amplify and thereby isolate the 
cloned sequence (U.S. Patrat 4,683,202, incorporated herein by reference). When PCR is 
used, it is generally preferable to incorporate known unique polynucleotide sequences flanking 
at least one, and preferably both, side(s) of the site in which a cloned sequence is inserted to 

20 facUitate recovery of the selected cloned sequence(s). 

Although the present inv^on has been described in some detail by way of 
illustration for purposes of clarity of understanding, it will be apparent that certain changes 
and modiflcations may be practiced within the scope of the claims. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: ERICKSON, JAMES R. 
POWERS, SCOTT 

(ii) TITLE OF INVENTION: REVERSE TWO-HYBRID METHOD 

10 

(iii) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooley Godward Castro Huddleson & Tatum 
15 (B) STREET: Five Palo Alto Square, Fourth Hoor 

(C) CITY: Palo Alto 

(D) STATE: California 

(E) COUNTRY: US 

(F) ZIP: 94306-2155 

20 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Roppy disk 

(B) COMPUTER: IBM PC compatible 

(Q OPERATING SYSTEM: PC-DOS/MS-DOS 
25 (D) SOFTWARE: Patentln Release #1 .0, Version #1 .25 

(vi) CURRENT APPUCATION DATA: 

(A) APPLICATION NUMBER: US 08/218,933 

(B) FILING DATE: 29^MAR-1994 
30 (C) CLASSIHCATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Torchia, Timothy E. 

(B) REGISTRATION NUMBER: 36,700 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 843-5481 

(B) TELEFAX: (415) 857-0663 

5 

(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 68 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(iu) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

20 

(ix) FEATURE: 

(A) NAME/KEY: niisc_feature 

(B) LOCATION: 1..68 

(D) OTHER INFORMATION: /standard_name= "PCR primer" 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GCGAATTCCT ACT6TATATA CATACAGTAC CATCTACTGT ATATACATAC AGTAOCCGCT 60 

30 

CGAGCGGC 68 



35 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
5 (B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

15 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..26 

(D) OTHER INFORMATION: /standard_name= "PCR primer* 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
CGCGGATCCC GTTCTTTCCA CTCCCG 

25 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(ui) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

5 

(ix) FEATURE: 

(A) NAME/KEY: inisc_feature 

(B) LOCATION: 1..30 

(D) OTHER INFORMATION: /standard_name= "PCR primer" 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
CGGATCCGAT GGAAGGATGC CCGCTGCTGC 30 

15 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

25 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

30 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION: I.. 33 

(D) OTHER INFORMATION: /standard_name= "oligomer for PCR" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CGGGATCCAT GACTGAATAT AAACTTGTGG TAG 33 

5 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
10 (B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

20 

(ix) FEATURE: 

(A) NAME/KEY: niisc_feature 

(B) LOCATION: 1..39 

(D) OTHER INFORMATION: /standard_name= "oligomer for PCR" 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
CGGGATCCTT ACATAATTAC ACACTTTGTC TTTCACTTG 39 

30 
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1, A reverse two-hybrid system comprising: 

(1) a first hybrid protein conq)rising a first interacting polypeptide sequence in 
5 polypeptide linkage to a DNA-binding domain of a transcriptional activator; 

(2) a second hybrid protein comprising a second interacting polypeptide 
sequence in polypeptide linkage to an activation domain of a transcriptional activator, whefein 
the second hybrid protein bmds to the first hybrid protein via contact of the interacting 
polynucleotide sequences under physiological conditions; 

10 (3) a relay gene whose transcription is dependent upon the first hybrid protein 

and the second hybrid protein being bound to each other, therd>y reconstituting a 
transcriptional activator; and 

(4) a r^rter gene whose transcription is rq>ressed by egression of the relay 
gene and which is substantially transcribed in the absence of relay gene expression. 

15 :5 

2. A reverse two-hybrid system of claim 1, wherein the two-hybrid 
system is in a yeast host cell. 



3. A reverse two-hybrid system of claim 1, wherem the first hybrid 
20 protein, the second hybrid protein, a relay protein, and a reporter protein are encoded on at 

least one polynucleotide and said at least one polynucleotide is introduced into a yeast host 
cell. 

4. A reverse two-hybrid system of claim 1, wherein the first 
25 hybrid protein comprises a lexA DNA-binding domain in polypeptide linkage to the first 

interacting polypeptide sequence; 

the second hybrid protein comprises a VP 16 activation domain in polypeptide 
linkage to the second interacting polypeptide sequence; 

the relay gene encodes Gal80 operably linked to a LexO sequence in a cis- 
30 linked relay gene transcription regulatory sequence; and 

the reporter gene comprises lacZ or HIS3 and is operably linked to a 
transcription regulatory sequence which confers Gal4-dependent transcription to cis-linked 
adjacent polynucleotide sequences. 
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5. A reverse two-hybrid system of claim 4 in a yeast cell produced by 
crossing a Saccharomvces organism having the genotype MATa his3 adel leu2 ura3 fys2 trpl 
GAU* galSOD LYS2::GAL1'HIS3 URA3::GALl'lacZ, 

5 6. A reverse two-hybrid system of claim 4 further comprising an agent. 

7. A reverse two-hybrid system of claim 4 further comprises an 
expression clone library member which expresses an intracellular polypeptide in the yeast 
host. 

10 

8. A yeast cell con^rising a reverse two-hybrid system of claim 1 or 4. 

9. A polynucleotide encoding a GalSO polypeptide and conq>rising at least 
one operably linked LexO binding site. 

15 

10. A polynucleotide of claun 9 in a yeast cell which contains a 
functionally disrupted endogenous GALSO gene. 



11. A yeast cell containing: 
20 a polynucleotide sequence encoding a first hybrid protein which is 

constitutively expressed; 

a polynucleotide sequence encoding a second hybrid protein which is 
constitutively expressed; 

a polynucleotide sequence encoding a relay protem whose expression is 
25 dependent upon mtermolecular binding of the first hybrid protein with the second hybrid 
protein under physiological conditions; and 

a polynucleotide sequence encoding a reporter protein whose expression is 
repressed by the relay protein. 

30 ^ 12. A yeast cell of claim 1 1 , further comprising an expression clone 

library member which expresses a polypeptide encoded by a cDNA. 



13. A yeast cell of cl^im 11, further comprising an agent having a 
molecular weight of less than 1,000 daltons. 
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14. A method for identifying agents which inhibit intemiolecular binding 
under physiological conditions between a first interacting polypeptide sequence and a second 
interacting polypeptide sequence, said method con5)rising the steps of: 

administering an agent to a host cell containing a reverse two-hybrid system of 
5 claim 1 or 4 and incubating the host ceU for a suitable period; 

determining whether the administration of the agent induces a substantially 
expression of the reporter gene; and 

identifying an agent which induces a substantially expression of the reporter 
gene as a protein interaction inhibitor. 

10 

15. A method of claim 14, wherein the agent is a molecule having a 
molecular weight less than about 1 ,000 daltons. 

16. A kit conq)rising a reverse two-hybrid system, a host cell, an 
IS instruction manual, and optionally a panel of agents for testing. 



17. A reverse two-hybrid system of claun 4, wherein the first interacting 
polypeptide sequence is a mammalian ras polypeptide and the second mteractmg polypeptide 
sequence is a Raf polypqnide. 

20 
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